Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 19382 |
| Missing cells | 2086 |
| Missing cells (%) | 0.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.2 MiB |
| Average record size in memory | 388.7 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 3 |
| Text | 1 |
# is highly overall correlated with claim_status and 5 other fields | High correlation |
claim_status is highly overall correlated with # and 4 other fields | High correlation |
video_comment_count is highly overall correlated with # and 4 other fields | High correlation |
video_download_count is highly overall correlated with # and 5 other fields | High correlation |
video_like_count is highly overall correlated with # and 5 other fields | High correlation |
video_share_count is highly overall correlated with # and 5 other fields | High correlation |
video_view_count is highly overall correlated with # and 5 other fields | High correlation |
verified_status is highly imbalanced (65.7%) | Imbalance |
claim_status has 298 (1.5%) missing values | Missing |
video_transcription_text has 298 (1.5%) missing values | Missing |
video_view_count has 298 (1.5%) missing values | Missing |
video_like_count has 298 (1.5%) missing values | Missing |
video_share_count has 298 (1.5%) missing values | Missing |
video_download_count has 298 (1.5%) missing values | Missing |
video_comment_count has 298 (1.5%) missing values | Missing |
# is uniformly distributed | Uniform |
# has unique values | Unique |
video_id has unique values | Unique |
video_download_count has 977 (5.0%) zeros | Zeros |
video_comment_count has 3434 (17.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-09-04 04:02:03.417027 |
|---|---|
| Analysis finished | 2024-09-04 04:02:12.357873 |
| Duration | 8.94 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
#
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 19382 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9691.5 |
| Minimum | 1 |
|---|---|
| Maximum | 19382 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 151.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 970.05 |
| Q1 | 4846.25 |
| median | 9691.5 |
| Q3 | 14536.75 |
| 95-th percentile | 18412.95 |
| Maximum | 19382 |
| Range | 19381 |
| Interquartile range (IQR) | 9690.5 |
Descriptive statistics
| Standard deviation | 5595.2458 |
|---|---|
| Coefficient of variation (CV) | 0.57733538 |
| Kurtosis | -1.2 |
| Mean | 9691.5 |
| Median Absolute Deviation (MAD) | 4845.5 |
| Skewness | 0 |
| Sum | 1.8784065 × 108 |
| Variance | 31306776 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 12928 | 1 | < 0.1% |
| 12926 | 1 | < 0.1% |
| 12925 | 1 | < 0.1% |
| 12924 | 1 | < 0.1% |
| 12923 | 1 | < 0.1% |
| 12922 | 1 | < 0.1% |
| 12921 | 1 | < 0.1% |
| 12920 | 1 | < 0.1% |
| 12919 | 1 | < 0.1% |
| Other values (19372) | 19372 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 19382 | 1 | |
| 19381 | 1 | |
| 19380 | 1 | |
| 19379 | 1 | |
| 19378 | 1 | |
| 19377 | 1 | |
| 19376 | 1 | |
| 19375 | 1 | |
| 19374 | 1 | |
| 19373 | 1 |
claim_status
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 298 |
| Missing (%) | 1.5% |
| Memory size | 1.0 MiB |
| claim | |
|---|---|
| opinion |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.9930832 |
| Min length | 5 |
Characters and Unicode
| Total characters | 114372 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | claim |
|---|---|
| 2nd row | claim |
| 3rd row | claim |
| 4th row | claim |
| 5th row | claim |
Common Values
| Value | Count | Frequency (%) |
| claim | 9608 | |
| opinion | 9476 | |
| (Missing) | 298 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| claim | 9608 | |
| opinion | 9476 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 28560 | |
| o | 18952 | |
| n | 18952 | |
| c | 9608 | 8.4% |
| l | 9608 | 8.4% |
| a | 9608 | 8.4% |
| m | 9608 | 8.4% |
| p | 9476 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 114372 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 28560 | |
| o | 18952 | |
| n | 18952 | |
| c | 9608 | 8.4% |
| l | 9608 | 8.4% |
| a | 9608 | 8.4% |
| m | 9608 | 8.4% |
| p | 9476 | 8.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 114372 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 28560 | |
| o | 18952 | |
| n | 18952 | |
| c | 9608 | 8.4% |
| l | 9608 | 8.4% |
| a | 9608 | 8.4% |
| m | 9608 | 8.4% |
| p | 9476 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 114372 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 28560 | |
| o | 18952 | |
| n | 18952 | |
| c | 9608 | 8.4% |
| l | 9608 | 8.4% |
| a | 9608 | 8.4% |
| m | 9608 | 8.4% |
| p | 9476 | 8.3% |
video_id
Real number (ℝ)
UNIQUE 
| Distinct | 19382 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6274541 × 109 |
| Minimum | 1.234959 × 109 |
|---|---|
| Maximum | 9.9998731 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 151.6 KiB |
Quantile statistics
| Minimum | 1.234959 × 109 |
|---|---|
| 5-th percentile | 1.6658012 × 109 |
| Q1 | 3.4304168 × 109 |
| median | 5.6186636 × 109 |
| Q3 | 7.8439602 × 109 |
| 95-th percentile | 9.5670758 × 109 |
| Maximum | 9.9998731 × 109 |
| Range | 8.7649141 × 109 |
| Interquartile range (IQR) | 4.4135434 × 109 |
Descriptive statistics
| Standard deviation | 2.5364405 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.45072611 |
| Kurtosis | -1.2013772 |
| Mean | 5.6274541 × 109 |
| Median Absolute Deviation (MAD) | 2.2071921 × 109 |
| Skewness | 0.0037792107 |
| Sum | 1.0907131 × 1014 |
| Variance | 6.4335302 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7017666017 | 1 | < 0.1% |
| 2453126010 | 1 | < 0.1% |
| 3115436537 | 1 | < 0.1% |
| 9728731936 | 1 | < 0.1% |
| 7774044518 | 1 | < 0.1% |
| 2955909586 | 1 | < 0.1% |
| 2869288216 | 1 | < 0.1% |
| 1753787560 | 1 | < 0.1% |
| 5781343321 | 1 | < 0.1% |
| 4009733168 | 1 | < 0.1% |
| Other values (19372) | 19372 |
| Value | Count | Frequency (%) |
| 1234959018 | 1 | |
| 1235937767 | 1 | |
| 1236284548 | 1 | |
| 1236594147 | 1 | |
| 1237008133 | 1 | |
| 1238480508 | 1 | |
| 1238937260 | 1 | |
| 1239698040 | 1 | |
| 1240091597 | 1 | |
| 1240349530 | 1 |
| Value | Count | Frequency (%) |
| 9999873075 | 1 | |
| 9999834973 | 1 | |
| 9999715467 | 1 | |
| 9999298421 | 1 | |
| 9999160062 | 1 | |
| 9998453278 | 1 | |
| 9998375429 | 1 | |
| 9997268707 | 1 | |
| 9997060902 | 1 | |
| 9996154488 | 1 |
video_duration_sec
Real number (ℝ)
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.421732 |
| Minimum | 5 |
|---|---|
| Maximum | 60 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 151.6 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 18 |
| median | 32 |
| Q3 | 47 |
| 95-th percentile | 58 |
| Maximum | 60 |
| Range | 55 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 16.229967 |
|---|---|
| Coefficient of variation (CV) | 0.50058916 |
| Kurtosis | -1.2108439 |
| Mean | 32.421732 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 0.0034523581 |
| Sum | 628398 |
| Variance | 263.41183 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 380 | 2.0% |
| 6 | 378 | 2.0% |
| 57 | 376 | 1.9% |
| 8 | 373 | 1.9% |
| 16 | 370 | 1.9% |
| 34 | 370 | 1.9% |
| 26 | 369 | 1.9% |
| 15 | 367 | 1.9% |
| 52 | 365 | 1.9% |
| 10 | 365 | 1.9% |
| Other values (46) | 15669 |
| Value | Count | Frequency (%) |
| 5 | 337 | |
| 6 | 378 | |
| 7 | 359 | |
| 8 | 373 | |
| 9 | 319 | |
| 10 | 365 | |
| 11 | 340 | |
| 12 | 345 | |
| 13 | 352 | |
| 14 | 357 |
| Value | Count | Frequency (%) |
| 60 | 349 | |
| 59 | 323 | |
| 58 | 355 | |
| 57 | 376 | |
| 56 | 343 | |
| 55 | 332 | |
| 54 | 353 | |
| 53 | 342 | |
| 52 | 365 | |
| 51 | 343 |
MISSING 
| Distinct | 19012 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 298 |
| Missing (%) | 1.5% |
| Memory size | 2.8 MiB |
Length
| Max length | 182 |
|---|---|
| Median length | 153 |
| Mean length | 89.093534 |
| Min length | 31 |
Characters and Unicode
| Total characters | 1700261 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18940 ? |
|---|---|
| Unique (%) | 99.2% |
Sample
| 1st row | someone shared with me that drone deliveries are already happening and will become common by 2025 |
|---|---|
| 2nd row | someone shared with me that there are more microorganisms in one teaspoon of soil than people on the planet |
| 3rd row | someone shared with me that american industrialist andrew carnegie had a net worth of $475 million usd, worth over $300 billion usd today |
| 4th row | someone shared with me that the metro of st. petersburg, with an average depth of hundred meters, is the deepest metro in the world |
| 5th row | someone shared with me that the number of businesses allowing employees to bring pets to the workplace has grown by 6% worldwide |
| Value | Count | Frequency (%) |
| that | 19188 | 6.3% |
| the | 19074 | 6.2% |
| a | 15219 | 5.0% |
| is | 10469 | 3.4% |
| in | 7863 | 2.6% |
| my | 7299 | 2.4% |
| of | 6670 | 2.2% |
| on | 5551 | 1.8% |
| to | 4169 | 1.4% |
| are | 3980 | 1.3% |
| Other values (1316) | 207028 |
Most occurring characters
| Value | Count | Frequency (%) |
| 293662 | ||
| e | 172706 | 10.2% |
| a | 133223 | 7.8% |
| t | 127604 | 7.5% |
| i | 103191 | 6.1% |
| n | 99064 | 5.8% |
| s | 94411 | 5.6% |
| o | 91798 | 5.4% |
| r | 83088 | 4.9% |
| l | 67749 | 4.0% |
| Other values (42) | 433765 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1700261 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 293662 | ||
| e | 172706 | 10.2% |
| a | 133223 | 7.8% |
| t | 127604 | 7.5% |
| i | 103191 | 6.1% |
| n | 99064 | 5.8% |
| s | 94411 | 5.6% |
| o | 91798 | 5.4% |
| r | 83088 | 4.9% |
| l | 67749 | 4.0% |
| Other values (42) | 433765 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1700261 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 293662 | ||
| e | 172706 | 10.2% |
| a | 133223 | 7.8% |
| t | 127604 | 7.5% |
| i | 103191 | 6.1% |
| n | 99064 | 5.8% |
| s | 94411 | 5.6% |
| o | 91798 | 5.4% |
| r | 83088 | 4.9% |
| l | 67749 | 4.0% |
| Other values (42) | 433765 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1700261 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 293662 | ||
| e | 172706 | 10.2% |
| a | 133223 | 7.8% |
| t | 127604 | 7.5% |
| i | 103191 | 6.1% |
| n | 99064 | 5.8% |
| s | 94411 | 5.6% |
| o | 91798 | 5.4% |
| r | 83088 | 4.9% |
| l | 67749 | 4.0% |
| Other values (42) | 433765 |
verified_status
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| not verified | |
|---|---|
| verified | 1240 |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 11.744092 |
| Min length | 8 |
Characters and Unicode
| Total characters | 227624 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | not verified |
|---|---|
| 2nd row | not verified |
| 3rd row | not verified |
| 4th row | not verified |
| 5th row | not verified |
Common Values
| Value | Count | Frequency (%) |
| not verified | 18142 | |
| verified | 1240 | 6.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| verified | 19382 | |
| not | 18142 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 38764 | |
| i | 38764 | |
| v | 19382 | |
| r | 19382 | |
| f | 19382 | |
| d | 19382 | |
| n | 18142 | |
| o | 18142 | |
| t | 18142 | |
| 18142 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 227624 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 38764 | |
| i | 38764 | |
| v | 19382 | |
| r | 19382 | |
| f | 19382 | |
| d | 19382 | |
| n | 18142 | |
| o | 18142 | |
| t | 18142 | |
| 18142 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 227624 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 38764 | |
| i | 38764 | |
| v | 19382 | |
| r | 19382 | |
| f | 19382 | |
| d | 19382 | |
| n | 18142 | |
| o | 18142 | |
| t | 18142 | |
| 18142 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 227624 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 38764 | |
| i | 38764 | |
| v | 19382 | |
| r | 19382 | |
| f | 19382 | |
| d | 19382 | |
| n | 18142 | |
| o | 18142 | |
| t | 18142 | |
| 18142 |
author_ban_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| active | |
|---|---|
| under review | |
| banned |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 6.6438964 |
| Min length | 6 |
Characters and Unicode
| Total characters | 128772 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | under review |
|---|---|
| 2nd row | active |
| 3rd row | active |
| 4th row | active |
| 5th row | active |
Common Values
| Value | Count | Frequency (%) |
| active | 15663 | |
| under review | 2080 | 10.7% |
| banned | 1639 | 8.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| active | 15663 | |
| under | 2080 | 9.7% |
| review | 2080 | 9.7% |
| banned | 1639 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 23542 | |
| i | 17743 | |
| v | 17743 | |
| a | 17302 | |
| c | 15663 | |
| t | 15663 | |
| n | 5358 | 4.2% |
| r | 4160 | 3.2% |
| d | 3719 | 2.9% |
| u | 2080 | 1.6% |
| Other values (3) | 5799 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 128772 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 23542 | |
| i | 17743 | |
| v | 17743 | |
| a | 17302 | |
| c | 15663 | |
| t | 15663 | |
| n | 5358 | 4.2% |
| r | 4160 | 3.2% |
| d | 3719 | 2.9% |
| u | 2080 | 1.6% |
| Other values (3) | 5799 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 128772 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 23542 | |
| i | 17743 | |
| v | 17743 | |
| a | 17302 | |
| c | 15663 | |
| t | 15663 | |
| n | 5358 | 4.2% |
| r | 4160 | 3.2% |
| d | 3719 | 2.9% |
| u | 2080 | 1.6% |
| Other values (3) | 5799 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 128772 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 23542 | |
| i | 17743 | |
| v | 17743 | |
| a | 17302 | |
| c | 15663 | |
| t | 15663 | |
| n | 5358 | 4.2% |
| r | 4160 | 3.2% |
| d | 3719 | 2.9% |
| u | 2080 | 1.6% |
| Other values (3) | 5799 | 4.5% |
video_view_count
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 15632 |
|---|---|
| Distinct (%) | 81.9% |
| Missing | 298 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 254708.56 |
| Minimum | 20 |
|---|---|
| Maximum | 999817 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 151.6 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 936 |
| Q1 | 4942.5 |
| median | 9954.5 |
| Q3 | 504327 |
| 95-th percentile | 903662.8 |
| Maximum | 999817 |
| Range | 999797 |
| Interquartile range (IQR) | 499384.5 |
Descriptive statistics
| Standard deviation | 322893.28 |
|---|---|
| Coefficient of variation (CV) | 1.267697 |
| Kurtosis | -0.63510685 |
| Mean | 254708.56 |
| Median Absolute Deviation (MAD) | 9829.5 |
| Skewness | 0.92846093 |
| Sum | 4.8608581 × 109 |
| Variance | 1.0426007 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7393 | 7 | < 0.1% |
| 2562 | 7 | < 0.1% |
| 3184 | 6 | < 0.1% |
| 2030 | 6 | < 0.1% |
| 4081 | 6 | < 0.1% |
| 8997 | 5 | < 0.1% |
| 2873 | 5 | < 0.1% |
| 639 | 5 | < 0.1% |
| 1122 | 5 | < 0.1% |
| 2252 | 5 | < 0.1% |
| Other values (15622) | 19027 | |
| (Missing) | 298 | 1.5% |
| Value | Count | Frequency (%) |
| 20 | 2 | |
| 22 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 31 | 2 | |
| 35 | 1 | < 0.1% |
| 37 | 3 |
| Value | Count | Frequency (%) |
| 999817 | 1 | |
| 999673 | 1 | |
| 999655 | 1 | |
| 999653 | 1 | |
| 999446 | 1 | |
| 999346 | 1 | |
| 999132 | 1 | |
| 999127 | 1 | |
| 999082 | 1 | |
| 998911 | 1 |
video_like_count
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 12224 |
|---|---|
| Distinct (%) | 64.1% |
| Missing | 298 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84304.636 |
| Minimum | 0 |
|---|---|
| Maximum | 657830 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 151.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 83.15 |
| Q1 | 810.75 |
| median | 3403.5 |
| Q3 | 125020 |
| 95-th percentile | 393957.3 |
| Maximum | 657830 |
| Range | 657830 |
| Interquartile range (IQR) | 124209.25 |
Descriptive statistics
| Standard deviation | 133420.55 |
|---|---|
| Coefficient of variation (CV) | 1.5826004 |
| Kurtosis | 2.4901651 |
| Mean | 84304.636 |
| Median Absolute Deviation (MAD) | 3365.5 |
| Skewness | 1.7868774 |
| Sum | 1.6088697 × 109 |
| Variance | 1.7801042 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32 | 22 | 0.1% |
| 19 | 19 | 0.1% |
| 10 | 18 | 0.1% |
| 9 | 18 | 0.1% |
| 6 | 18 | 0.1% |
| 12 | 17 | 0.1% |
| 60 | 17 | 0.1% |
| 4 | 17 | 0.1% |
| 5 | 17 | 0.1% |
| 43 | 16 | 0.1% |
| Other values (12214) | 18905 | |
| (Missing) | 298 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 1 | 16 | |
| 2 | 16 | |
| 3 | 16 | |
| 4 | 17 | |
| 5 | 17 | |
| 6 | 18 | |
| 7 | 13 | |
| 8 | 11 | |
| 9 | 18 |
| Value | Count | Frequency (%) |
| 657830 | 1 | |
| 656243 | 1 | |
| 654588 | 1 | |
| 653561 | 1 | |
| 649695 | 1 | |
| 648101 | 1 | |
| 647236 | 1 | |
| 639877 | 1 | |
| 636812 | 1 | |
| 635335 | 1 |
video_share_count
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 9231 |
|---|---|
| Distinct (%) | 48.4% |
| Missing | 298 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16735.248 |
| Minimum | 0 |
|---|---|
| Maximum | 256130 |
| Zeros | 99 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 151.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 115 |
| median | 717 |
| Q3 | 18222 |
| 95-th percentile | 89017.5 |
| Maximum | 256130 |
| Range | 256130 |
| Interquartile range (IQR) | 18107 |
Descriptive statistics
| Standard deviation | 32036.174 |
|---|---|
| Coefficient of variation (CV) | 1.9142933 |
| Kurtosis | 8.3386485 |
| Mean | 16735.248 |
| Median Absolute Deviation (MAD) | 709 |
| Skewness | 2.7226563 |
| Sum | 3.1937548 × 108 |
| Variance | 1.0263165 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 147 | 0.8% |
| 1 | 144 | 0.7% |
| 3 | 122 | 0.6% |
| 5 | 109 | 0.6% |
| 0 | 99 | 0.5% |
| 4 | 95 | 0.5% |
| 8 | 95 | 0.5% |
| 6 | 90 | 0.5% |
| 12 | 76 | 0.4% |
| 9 | 76 | 0.4% |
| Other values (9221) | 18031 | |
| (Missing) | 298 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 99 | |
| 1 | 144 | |
| 2 | 147 | |
| 3 | 122 | |
| 4 | 95 | |
| 5 | 109 | |
| 6 | 90 | |
| 7 | 72 | |
| 8 | 95 | |
| 9 | 76 |
| Value | Count | Frequency (%) |
| 256130 | 1 | |
| 249672 | 1 | |
| 241010 | 1 | |
| 240154 | 1 | |
| 238004 | 1 | |
| 234618 | 1 | |
| 222848 | 1 | |
| 220745 | 1 | |
| 219917 | 1 | |
| 217265 | 1 |
video_download_count
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 4336 |
|---|---|
| Distinct (%) | 22.7% |
| Missing | 298 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1049.4296 |
| Minimum | 0 |
|---|---|
| Maximum | 14994 |
| Zeros | 977 |
| Zeros (%) | 5.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 151.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7 |
| median | 46 |
| Q3 | 1156.25 |
| 95-th percentile | 5547.85 |
| Maximum | 14994 |
| Range | 14994 |
| Interquartile range (IQR) | 1149.25 |
Descriptive statistics
| Standard deviation | 2004.2999 |
|---|---|
| Coefficient of variation (CV) | 1.9098945 |
| Kurtosis | 8.4357775 |
| Mean | 1049.4296 |
| Median Absolute Deviation (MAD) | 46 |
| Skewness | 2.7361623 |
| Sum | 20027315 |
| Variance | 4017218.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 977 | 5.0% |
| 1 | 972 | 5.0% |
| 2 | 704 | 3.6% |
| 3 | 634 | 3.3% |
| 4 | 556 | 2.9% |
| 5 | 402 | 2.1% |
| 6 | 381 | 2.0% |
| 7 | 315 | 1.6% |
| 8 | 303 | 1.6% |
| 9 | 283 | 1.5% |
| Other values (4326) | 13557 | |
| (Missing) | 298 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 977 | |
| 1 | 972 | |
| 2 | 704 | |
| 3 | 634 | |
| 4 | 556 | |
| 5 | 402 | |
| 6 | 381 | 2.0% |
| 7 | 315 | 1.6% |
| 8 | 303 | 1.6% |
| 9 | 283 | 1.5% |
| Value | Count | Frequency (%) |
| 14994 | 1 | |
| 14417 | 1 | |
| 14308 | 1 | |
| 14146 | 1 | |
| 14044 | 1 | |
| 13954 | 1 | |
| 13859 | 1 | |
| 13771 | 1 | |
| 13653 | 2 | |
| 13552 | 1 |
video_comment_count
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 2424 |
|---|---|
| Distinct (%) | 12.7% |
| Missing | 298 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 349.31215 |
| Minimum | 0 |
|---|---|
| Maximum | 9599 |
| Zeros | 3434 |
| Zeros (%) | 17.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 151.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 9 |
| Q3 | 292 |
| 95-th percentile | 1921 |
| Maximum | 9599 |
| Range | 9599 |
| Interquartile range (IQR) | 291 |
Descriptive statistics
| Standard deviation | 799.63886 |
|---|---|
| Coefficient of variation (CV) | 2.2891814 |
| Kurtosis | 19.711106 |
| Mean | 349.31215 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 3.8946817 |
| Sum | 6666273 |
| Variance | 639422.31 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3434 | 17.7% |
| 1 | 2222 | 11.5% |
| 2 | 1097 | 5.7% |
| 3 | 788 | 4.1% |
| 4 | 545 | 2.8% |
| 5 | 432 | 2.2% |
| 6 | 320 | 1.7% |
| 7 | 272 | 1.4% |
| 8 | 236 | 1.2% |
| 9 | 203 | 1.0% |
| Other values (2414) | 9535 | |
| (Missing) | 298 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 3434 | |
| 1 | 2222 | |
| 2 | 1097 | 5.7% |
| 3 | 788 | 4.1% |
| 4 | 545 | 2.8% |
| 5 | 432 | 2.2% |
| 6 | 320 | 1.7% |
| 7 | 272 | 1.4% |
| 8 | 236 | 1.2% |
| 9 | 203 | 1.0% |
| Value | Count | Frequency (%) |
| 9599 | 1 | |
| 8674 | 1 | |
| 8481 | 1 | |
| 8470 | 1 | |
| 7819 | 1 | |
| 7767 | 1 | |
| 7694 | 1 | |
| 7605 | 1 | |
| 7458 | 1 | |
| 7411 | 1 |
| # | author_ban_status | claim_status | verified_status | video_comment_count | video_download_count | video_duration_sec | video_id | video_like_count | video_share_count | video_view_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| # | 1.000 | 0.223 | 0.991 | 0.169 | -0.713 | -0.717 | -0.000 | -0.002 | -0.737 | -0.719 | -0.745 |
| author_ban_status | 0.223 | 1.000 | 0.316 | 0.060 | 0.080 | 0.121 | 0.012 | 0.000 | 0.162 | 0.120 | 0.204 |
| claim_status | 0.991 | 0.316 | 1.000 | 0.170 | 0.359 | 0.524 | 0.015 | 0.000 | 0.704 | 0.507 | 0.900 |
| verified_status | 0.169 | 0.060 | 0.170 | 1.000 | 0.059 | 0.091 | 0.000 | 0.007 | 0.118 | 0.077 | 0.152 |
| video_comment_count | -0.713 | 0.080 | 0.359 | 0.059 | 1.000 | 0.951 | -0.006 | 0.006 | 0.900 | 0.857 | 0.835 |
| video_download_count | -0.717 | 0.121 | 0.524 | 0.091 | 0.951 | 1.000 | 0.006 | 0.006 | 0.939 | 0.890 | 0.862 |
| video_duration_sec | -0.000 | 0.012 | 0.015 | 0.000 | -0.006 | 0.006 | 1.000 | 0.009 | 0.005 | 0.005 | 0.003 |
| video_id | -0.002 | 0.000 | 0.000 | 0.007 | 0.006 | 0.006 | 0.009 | 1.000 | 0.005 | 0.001 | 0.004 |
| video_like_count | -0.737 | 0.162 | 0.704 | 0.118 | 0.900 | 0.939 | 0.005 | 0.005 | 1.000 | 0.940 | 0.910 |
| video_share_count | -0.719 | 0.120 | 0.507 | 0.077 | 0.857 | 0.890 | 0.005 | 0.001 | 0.940 | 1.000 | 0.864 |
| video_view_count | -0.745 | 0.204 | 0.900 | 0.152 | 0.835 | 0.862 | 0.003 | 0.004 | 0.910 | 0.864 | 1.000 |
| # | claim_status | video_id | video_duration_sec | video_transcription_text | verified_status | author_ban_status | video_view_count | video_like_count | video_share_count | video_download_count | video_comment_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | claim | 7017666017 | 59 | someone shared with me that drone deliveries are already happening and will become common by 2025 | not verified | under review | 343296.0 | 19425.0 | 241.0 | 1.0 | 0.0 |
| 1 | 2 | claim | 4014381136 | 32 | someone shared with me that there are more microorganisms in one teaspoon of soil than people on the planet | not verified | active | 140877.0 | 77355.0 | 19034.0 | 1161.0 | 684.0 |
| 2 | 3 | claim | 9859838091 | 31 | someone shared with me that american industrialist andrew carnegie had a net worth of $475 million usd, worth over $300 billion usd today | not verified | active | 902185.0 | 97690.0 | 2858.0 | 833.0 | 329.0 |
| 3 | 4 | claim | 1866847991 | 25 | someone shared with me that the metro of st. petersburg, with an average depth of hundred meters, is the deepest metro in the world | not verified | active | 437506.0 | 239954.0 | 34812.0 | 1234.0 | 584.0 |
| 4 | 5 | claim | 7105231098 | 19 | someone shared with me that the number of businesses allowing employees to bring pets to the workplace has grown by 6% worldwide | not verified | active | 56167.0 | 34987.0 | 4110.0 | 547.0 | 152.0 |
| 5 | 6 | claim | 8972200955 | 35 | someone shared with me that gross domestic product (gdp) is the best financial indicator of a country's overall trade potential | not verified | under review | 336647.0 | 175546.0 | 62303.0 | 4293.0 | 1857.0 |
| 6 | 7 | claim | 4958886992 | 16 | someone shared with me that elvis presley has sold more records than the music band the beatles | not verified | active | 750345.0 | 486192.0 | 193911.0 | 8616.0 | 5446.0 |
| 7 | 8 | claim | 2270982263 | 41 | someone shared with me that the best selling single of all time is "white christmas" by bing crosby | not verified | active | 547532.0 | 1072.0 | 50.0 | 22.0 | 11.0 |
| 8 | 9 | claim | 5235769692 | 50 | someone shared with me that about half of the world's population can access the web via a mobile device | not verified | active | 24819.0 | 10160.0 | 1050.0 | 53.0 | 27.0 |
| 9 | 10 | claim | 4660861094 | 45 | someone shared with me that it would take a 50 petabyte drive to store every written work ever created | verified | active | 931587.0 | 171051.0 | 67739.0 | 4104.0 | 2540.0 |
| # | claim_status | video_id | video_duration_sec | video_transcription_text | verified_status | author_ban_status | video_view_count | video_like_count | video_share_count | video_download_count | video_comment_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 19372 | 19373 | NaN | 5731766527 | 16 | NaN | not verified | active | NaN | NaN | NaN | NaN | NaN |
| 19373 | 19374 | NaN | 5731838072 | 46 | NaN | verified | active | NaN | NaN | NaN | NaN | NaN |
| 19374 | 19375 | NaN | 3559825127 | 42 | NaN | not verified | active | NaN | NaN | NaN | NaN | NaN |
| 19375 | 19376 | NaN | 2159797367 | 45 | NaN | verified | active | NaN | NaN | NaN | NaN | NaN |
| 19376 | 19377 | NaN | 4099538565 | 7 | NaN | not verified | active | NaN | NaN | NaN | NaN | NaN |
| 19377 | 19378 | NaN | 7578226840 | 21 | NaN | not verified | active | NaN | NaN | NaN | NaN | NaN |
| 19378 | 19379 | NaN | 6079236179 | 53 | NaN | not verified | active | NaN | NaN | NaN | NaN | NaN |
| 19379 | 19380 | NaN | 2565539685 | 10 | NaN | verified | under review | NaN | NaN | NaN | NaN | NaN |
| 19380 | 19381 | NaN | 2969178540 | 24 | NaN | not verified | active | NaN | NaN | NaN | NaN | NaN |
| 19381 | 19382 | NaN | 8132759688 | 13 | NaN | not verified | active | NaN | NaN | NaN | NaN | NaN |